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Abstract 

We consider the problem of coding for quantum channels with side information that is 
available ahead of time at the transmitter but not at the receiver. We find a single-letter expres- 
sion for the entanglement-assisted quantum capacity of such channels which closely parallels 
Gel'fand and Pinsker's solution to the classical version of the same problem. This theorem can 
also be used to find a lower bound on the unassisted quantum capacity of these channels. 

1 Introduction 

Consider the following problem: we have a noisy quantum memory device that can store n qubits 
and which contains a certain fraction of defective cells. The cells that do work can be modelled 
as a depolarizing channel, but the defective ones always output |0). We can test which cells are 
defective before writing to the memory device, but this information is not necessarily available 
when reading from it. What is the best asymptotic rate at which we can store qubits reliably on 
this device? This problem can be generalized to any channel where the transmitter has access to 
side information about the channel state while the receiver does not. 

The corresponding classical problem has been solved by Gel'fand and Pinsker in [GP80]. They 
consider channels modelled as a conditional probability distribution PY\xs{y\xj s)f x e X,s e 
S,y e y, where x, y and s represent the input, output and state of the channel respectively. The 
channel state is i.i.d. and distributed according to ps{s)- The encoder has access to the entire 
sequence of channel states ahead of time whereas the decoder does not. They have shown that the 
capacity of such a channel is given by 

C= max [I{U;Y)- I{U]S)] (1) 

lusx'^'P 

where V is the set of all probability distributions onU x X x S such that the marginal on S is equal 
to ps(s);Z^ is an arbitrary set that can be chosen such that \U\ < jA'l + liSI. The mutual informations 
are computed on the distribution • qusx- 

Here we shall generalize this result to quantum channels. Namely, we will prove that the 
entanglement-assisted quantum capacity of quantum channels with side information at the trans- 
mitter has the same form as (1) and, a relatively rare fact in quantum information theory, has a 
single-letter converse. 



Many other quantum information processing tasks involving side information or multiple 
users have been considered in the literature. For instance, data compression with side information 
at the receiver (generally known as Slepian-Wolf coding [SW71]) was generalized to the quantum 
world in [DW03] in the case when only the side information is quantum, and in [ADHW06] when 
both the data and the side information are quantum. A slightly different generalization of Slepian- 
Wolf coding called state merging was presented in [HOW06]; in this case, the side information and 
the data to be sent are both quantum, but transmission is achieved through entanglement and 
classical communication. Classical channel simulation with quantum side information at the re- 
ceiver is considered in [LD06]. Another primitive called quantum state redistribution [DY06] allows 
one to send the C subsystem of a mixed state p^^'-^ from Alice (who initially holds both A and C) 
to Bob who already holds B. Quantum generalizations of broadcast channels [YHD06, DHL07] 
and multiple access channels [HDW05, WinOl, KliOl] have also been considered. 

Our result is very much in the same spirit as those in [DHW03], [HOW06], and [ADHW06]. 
In [DHW03], it is shown that a large number of quantum information protocols, such as one-way 
entanglement distillation [DW05], entanglement-assisted channel coding [Llo96, Sho02, Dev05], 
channel simulation [BDH+06a] and many others can be derived by simple transformations from 
two basic protocols, called the mother and father protocols. In [HOW06], the authors analyze the 
effect of making random measurements on a state and show that this leads to the state merging 
protocol, which can be used to derive several additional protocols. In [ADHW06], a fully quantum 
version of state merging is presented, called the "fully quantum Slepian-Wolf" protocol (FQSW) 
from which all other protocols mentioned (including the father, the mother, state merging, and 
state redistribution [Opp08]) can be derived. This generality is not surprising: the state merging 
and FQSW papers essentially consist of an analysis of the power of randomly selected unitary 
transformations, which can be viewed as the quantum generalization of random codes. We will 
use this approach here for our direct coding theorem, replacing Gel'fand and Pinsker's random 
binning argument by a random unitary, as in the FQSW theorem. 

In the next section, we will introduce our notation, define precisely what a quantum channel 
with side information at the transmitter is, and give appropriate definitions of achievable rate and 
capacity. Section 3 will consist of a quick review of the FQSW protocol. Section 4 will be devoted 
to the direct coding theorem itself, with the single-letter converse given in section 5. 

2 Notation, definitions and background material 

Quantum subsystems will be denoted by capital letters A, B, and so on; we will use superscripts 
on states to indicate which subsystems they are defined on. Given a ket we will use the 
shorthand ifj = |^) (^| for its associated density operator. For any density operator, we will denote 
its partial trace on one subsystem by removing that subsystem from the superscript, i.e. = 
Tr^ p"^^ . Furthermore, we will use the notation A^ to denote the tensor product of n copies of the 
system A. 

For operations on quantum states, including unitaries, isometries and superoperators, we use 
a similar convention to denote the input and output spaces; for example, J\f^^^ will denote a 
superoperator from A to B. Superscripts will be omitted when doing so is not likely to cause 
confusion. The dimension of a system A is denoted \A\. 

A superoperator can always be extended to an isometry by adding another output subsystem 
representing the environment. The operation performed by this extension is exactly the same 
as the original channel if we trace out the environment system. We will denote the isometric 
extension of by U^~^^^ . Here C/\r does not act on density operators but on the Hilbert 
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space itself. 

To make the notation less cumbersome, we will use the symbol • to denote conjugation; i.e. 

A-B := ABAK 

We will also define |<I>)'^"^' := — ^ X]!=o N)^^ ' where the and are some standard 

orthonormal bases on 5 and S'. 

We will often use the trace norm, which we define to be ||A/||i := Tr |Af| for any Hermitian 
matrix M. This norm induces a metric — cr ||i on the space of quantum states. 

The von Neumann entropy of a density operator will be denoted H{A)p; when p is defined 
on more than one subsystem, H{A)p denotes the entropy of its restriction to A. The quantum 
mutual information is the function I{A; B)p = H{A)p + H{B)p — H{AB)p, where p is some state 
on systems A and B (and possibly more subsystems). 

We will say that two families of states i/' arid tp parametrized by their size n are asymptotically 
equal (denoted ij: 99) if 11-0 — '/'Hi goes to zero as n 00. See Appendix A for a formal 
definition. 

Finally, we will often make use of a version of Uhlmann's theorem [Uhl76] shown in [DHW05], 
lemma 2.2: whenever -0"^ and (f^ are density matrices such that H-i/;^ — ^ £, for any pair of 

purifications il)^^ and y?^*^ of i/j^ and (p^ respectively, there exists a partial isometry V'^~'^ such 
that Wip^^ — V ■ f^^Wi ^ 2y/e. In particular, whenever we have two families of states ip^ and ip^ 
such that V'^ ~(a) f"^' then for any family of purifications il)^^ and 93"^^ there exists a family of 
partial isometries V'~"'~*^ such that ip^^ ^ " f^'^^ ■ 



2.1 Definition of quantum cliannels witli side information at tlie transmitter 

A quantum channel with side information at the transmitter is defined by a superoperator AA"^'"^^^ 
and a quantum state IV')'^'^ ; this quantum state represents the side information. Alice has access 
to 5' and can input a state of her choice into A'. One way to view this is to say that Alice shares 
entanglement with the channel itself. This framework allows us to consider both quantum and 
classical side information about the channel in a unified manner. 

To illustrate this, consider the example of the depolarizing channel with defects given in the 
introduction. For this case, we can choose 1-0) to be ^/p\00) + a/1 — p\ll). The superoperator 
then measures the 5 subsystem, and outputs 1 0) if the outcome is 0. If the outcome is 1, it applies 
the depolarizing channel to A' and sends the output to Bob. 

In this paper, we will be mostly interested in the entanglement-assisted quantum capacity of 
such channels. 

We will now define precisely the notions of code, achievable rate, and capacity for quantum 
channels with side information at the transmitter. Even though the main concern of the paper is 
the entanglement-assisted capacity, we are nonetheless interested in the amount of entanglement 
consumed by our protocol. We will therefore consider protocols in which some of the entangle- 
ment used during the execution of the protocol is returned at the end, since doing this improves 
the entanglement consumption rate of the protocol. 

Definition 1. A {Q, n, e)-code for a quantum channel with side information at the transmitter {Af, \\p)) 
consists of an encoding superoperator gR'AS'"^A'"A ^ decoding superoperator dB'^b-^bb g^^j^ ^^^^ 

mM{£{ipin)))-^out\\i^e (2) 

where (/?in = <S> <S> {ip^^'Y, (font = ® ^^^^ , and log \R\ = log \R'\ = nQ. 
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It will turn out to be much more convenient for us to use isometric extensions of the encod- 
ing and decoding superoperators. We will generally use "~^a "AD ^.^ ^^gj-^ote the isometric 
extension of the encoding map £ and y^'^'^^BBG £qj. ^j^g decoding map V. 

Definition 2. We say that Q is an achievable rate for the channel {Af, if there exists a sequence of 
{Q, n, en)-codes such that e„ ^ as n ^ oo. 

In other words, Q is achievable if there exists a family of codes as defined above such that 

V-U^^'-W- (^in «(a) ^out (3) 

The capacity of a channel {M, \ ^^J)) is the supremum of all achievable rates. 
The goal of this paper is to establish the following theorem: 

channel with side information at 

(4) 

= anduj = Af{a). 

Of course, this theorem entails that the entanglement-assisted classical capacity of quantum 
channels with side information at the transmitter is 

C = sup {I{A;B)^(^„)-I{A;S),} (5) 

via super-dense coding. 



Theorem 1. The entanglement-assisted quantum capacity of a quantum 
the transmitter {M, 1-0)) is 

C = suv[\l{A-B)^-\l{A-S)„ 



^2 ' ' ' 2 

The supremum is taken over all mixed states of the form a"^"^'^ where 



3 The FQSW theorem 

Before presenting our protocol, we first give a quick overview of the fully quantum Slepian-Wolf 
protocol [ADHW06]. Suppose Alice and Bob hold a mixed state p"^^. We introduce a reference 
system R to purify the state; the resulting state is l^p)^^^. Alice would like to transfer her state to 
Bob with very high fidelity by sending him as few qubits as possible. The FQSW theorem states 
that Alice can do this by first applying a unitary transformation to her entire share of the state (a 
random unitary selected according to the Haar measure will work with high probability), splitting 
her share into two subsystems A and A, and then sending A to Bob. 

Note that this scheme works provided that the subsystems A and R are in a product state after 
applying the random unitary: since Bob holds the purifying system of AR, there exists a local 
unitary that Bob can apply to turn his purifying system into separate purifying systems of the two 
subsystems. The purifying system of R is exactly the original state that Alice wanted to send to 
Bob together with the share Bob originally had, and A together with its purifying system is an 
EPR pair shared by Alice and Bob. This last feature is an added bonus of the protocol: Alice and 
Bob get some free entanglement at the end. 

It is possible to calculate how close A and R are to being in a product state. The result of the 
calculation is the following (see [ADHW06] for details): 



U(A) 



R 



\AP 



(6) 
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where Par{U) = Tr^[C/ • i/j"^^]. Since the inequality holds for the average over choices of U, there 
must exist at least one U that satisfies it. 

A special case of interest is when the initial state is an i.i.d. state of the form {\ip)'^^^)^'^. In 
this case, it can be shown that as long as log \ A\ ^ n[^I{A; R) + 6], it will be true that 

V^^^"" -(a) ^ ® (7) 

where (p^^^®"^®" is the result of applying the random unitary to Ha • (^^^^)®", where Ha is the 
projector onto the typical subspace of the A subsystem, as defined in Appendix B, and 5 > 0. 

Note that it is also possible to show that the value p^^{U) — jjj "X" V'^ is exponentially con- 
centrated around its mean value. Hence, by the union bound, given a constant number of equa- 
tions of the form (7) in which the random unitary is applied to the same system, for n large enough 
there must exist a single family of random unitaries that satisfies all of them at the same time. We 
will make use of this fact in the direct coding theorem. 

4 Direct coding theorem 

The direct coding theorem is very similar to the one in [DHL07]. We start out with n copies 
of ^ from theorem 1 and construct the nth term in a sequence of codes achieving a rate of 
|[/(^; i?)Ar(cr) — S)a\- It will be convenient for us to purify a by introducing an additional 
subsystem D. 

The way to derive a code from cr®" is to transform this state into one which looks like y^in 
in definition 1. To do this, we will first restrict a®'^ to its typical subspace on yl" using a family 

A^ ^A^ 

of typical projectors (see Appendix B); we then have nH{A)a^ — n6n ^ log|Atyp| ^ 

nH{A)(j + n6n, with (5„ — > as n — > oo. Then, for each n, we shall apply a random unitary on Atyp 
and split it into three subsystems: B, A, and R, with ^I{A; ED)j^(^^^ + 2n6n qubits, ^I{A; S)ct + 
2n6n qubits, and log |^typ| — log \B\ — log \A\ qubits respectively (hence log \R\ ^ ^I{A; B)j^(^„) — 
^I{A; S)„ - 5n6n). Call this random unitary [/^typ^iisi According to the FQSW theorem, with 
high probability over the choice of U, both of the following will hold: 

TiRB 

(U-Ua- a®«)«^^" i-^ ® (^^)®- (8) 

\RB\ 

{Ufp -U-Ha- ^ ® (AA(cT)^^)®" (9) 

\RA\ 

By Uhlmann's theorem, this implies that there exist families of partial isometries ^^'^•S""^^'"^0" 
and yBB^^^BBG (one W and one V for each n) and a family of pure states ^^e^d"^ such that 

{U-Ha- W ■ c^^^' ci>^^ (V;^^')®" (10) 

Here, (10) follows from the fact that the left-hand side (resp. right-hand side) of (10) is a purifica- 
tion of the left-hand side (resp. right-hand side) of (8); W is the unitary required by Uhlmann's 
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theorem to make the two sides close in trace distance. Equation (11) follows from (9) in a similar 
manner. Combining these two equations and using the fact that asymptotic equality is transitive, 
we get that 

V-U^^'-W- ($^^' ® f (V'^^')®") ® C^^"^'' (12) 

which proves the direct coding theorem. 

4.1 Entanglement cost and unassisted transmission 

One can check the rate at which this protocol consumes entanglement between the transmitter 
and the receiver. The size of B is §/(j4; ED)^(^^^ + 2n5„ qubits; however, one can see that some 

entanglement is recovered at the end of the protocol in the A and B subsystems, with log \A\ = 
^I{A; S)cr+2n6n- Hence, the net amount of entanglement consumed is ^ [l{A; ED)^(^„-^ — I{A; S)a] 
Furthermore, this allows us to calculate the transmission rate when this protocol is used to 
send qubits without preshared entanglement. In this mode, we use part of the transmission rate 
to send the entanglement needed by the protocol, and the rest of the rate to send the data we 
are actually interested in sending. We compute this rate by subtracting the entanglement cost 
from the number of transmitted qubits: log \R\ - log \B\ + log \A\ > ^I{A; B)j^(^^^ - f /(^; S)a — 
^I{A;ED)j^^„) + ^I{A;S)^-5n6n = ^I{A)B)^^^^-5n6n. We therefore have a rate of i/(^)5)^(^) 
for unassisted transmission. This expression is of the same form as for the usual unassisted quan- 
tum coding theorem, although the coherent information is defined on a slightly different state. 

5 Converse theorem 

We shall now prove that for any achievable rate Q, there exists a state cr"^^''^ as in theorem 1 for 
which Q = ^I{A; -B)Ar(cr) — ^^(^; S)a- This part is essentially the same as in [GP80], with a few 
adaptations to the quantum case. In particular, one must pay close attention to which state the 
various mutual informations are defined on, since we will be dealing with states where only some 
fraction of the n instances of the channel has been applied. 

First, let and yBB^^BBG encoding and decoding isometries for a {Q,n,e)- 

code as in definition 1, and let ifin = ® (V''^^')®", a = W ■ tpi^ and uj = Uff" ■ a. Then, 
by Fannes' inequality we must have that 

I{R; B'^B)^ ^ 2n{Q - d{e, n)) (13) 

where d{e, n) := ^ + Notice that 

/(/?; SB")^ = I{B; R)^ + B''\B)^ (14) 
= I{R;B''^\B)^ (15) 
^liRB-B"")^ (16) 

where (15) is due to the fact that R and B are independent. Combining this with I{RB; S"')^ = 0, 
we have 

I{RB; B")^ - liRB; 5"), ^ 2n{Q - d{e, n)) (17) 
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We will now introduce a few shorthands which will make the notation considerably less cum- 
bersome: we will write instead oi Bi, . . . , Bi and Bj instead oi Bi, . . . , Bj, and likewise for S. 
Define also 

X{i) := RBB'-^Sf^^ (18) 
:= RBS^^^ (19) 

Note that these are nothing more than groupings of subsystems. We also define the following 
sequence of states: 

u;(i) := (f/®^0l®"-i).a (20) 

In other words, uj{i) is the result of applying the first i instances of the channel to the state a. 
We shall now prove the inequality 

n 

I{RB;B'-)^ - I{RB;S^)a ^ {/(X(i); i?,)^,) - 5.)^(,_i) } . (21) 

j=i 

Since each term in this sum is of the form I{A; B)^(^^) — I{A;S)(j for some a"^"^'^, the highest 
term is achievable by the direct coding theorem and therefore there exists a state for which Q ^ 
I{A; B)j^(^^^ — I{A; S)a- This allows us to conclude the theorem. 

We now proceed in exactly the same way as in [GP80] to establish (21): we consider the in- 
equality 

liYii); B%,) - liYii); S%^i_,) ^ [/(y(i - 1); i?'-^)^(,_i) - I{Y{i - 1); S'-^^,.,)] 

+ [l{X{iy,B^)^^i) - I(X(i); 5.)^(,_i)] . (22) 

Summing up all these inequalities from i = 2 to i = n, we obtain (21), since Y{n) = RB and 

y(i) = x(i). 

Now, to prove (22), we use the following identities which follow from the definitions of X{i) 
and Y{i) and from basic properties of the mutual information. 



/(y(i); B%^,) = I{Y{iy,B'-Xir) + I{Y{iyBi\B'-X{€) (23) 

/(y(i);S*)^(,_i) = /(y(^);5,).(._i) +/(y(^);5-'-^|5,).(.-i) (24) 

/(y(i-l);5*-i)^(,„i) =/(y(0;5^"^|5,).(.-i) (25) 

I{Y{i - 1); i?-'-^)^(,_i) = /(5,; S*-i)^(,_i) + I{Y{i)- i?^-^|5i)^(,_i) (26) 

/(X(i);i?,)..W = I{B'~^;B,)^^~^+I{Y(i)-B,\B'~Xi,) (27) 

/(X(i); 5,)^(.-i) = /(i?'-^ 5,)^(i_i) + I{Y{i)- Si\B'~X{^~l) (28) 

Substituting these into (22) and using the identity 

I{A- B) - I{A- B\C) = I{A- C) - I{A- C\B) (29) 



which holds on any mixed state p , we get that the difference between the right-hand side and 
the left-hand side of (22) is I{B^~^; which is always normegative. This concludes the proof. 
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6 Discussion and Conclusion 



This result further strengthens the parallel between classical information theory problems and 
their entanglement-assisted quantum counterparts. Indeed, the capacity formula (4) has the same 
form as the classical version (1); the same phenomenon arises in the case of the entanglement- 
assisted capacities of regular point-to-point channels [BSST02], multiple-access channels [HDW05], 
and, for the best coding theorem we know, broadcast channels [DHL07]. A similar equivalence 
has also been shown at the level of error-correcting codes themselves [HDB07, BDH06c, BDH06b]. 
It would be particularly interesting to have a systematic way in which classical coding theorems 
could be transformed into entanglement-assisted quantum protocols as it would enable us to im- 
port much larger classes of results from classical information theory into the quantum world. 

Returning to our result, there is one remaining issue that one would like to solve in order 
to have a fully satisfactory characterization of the achievable rate region: we currently have no 
upper bound on the dimension of the A system needed to achieve the capacity in expression (4). 
Thus, despite having a single-letter converse, we unfortunately do not have a way to compute the 
capacity. In the classical case, it is possible to use Caratheodory's theorem to bound the cardinality 
of U in the optimal input distribution. However, in the quantum case, this approach fails due 
to the fact that the quantum conditional entropy cannot in general be expressed as H{A\B) = 
^^p(6)-fr(yl|i? = b). On the other hand, there is little reason to believe that large dimensions 
are necessary to achieve the optimal rate, but we have not managed to prove it. In fact, one 
encounters a very similar difficulty when trying to calculate the squashed entanglement [CW04] 
of a particular state since we have no bound on the size of the subsystem we need to condition on. 
We therefore leave this issue as an open problem. 

One might also wonder about a related problem: whether the capacity can in general be 
achieved by optimizing only over pure states ^ . This would imply an upper bound on \A\. 
However, one can show that this cannot be the case: take, for example, a qubit-to-qubit channel 
which applies one of the four Pauli operations with equal probability, but where S tells the trans- 
mitter which one of the four operations is applied. The capacity of such a channel is clearly one 
qubit per transmission. Suppose that this rate is achievable using a pure state a"^^ ^. Then, we 
must have ^I{A; -B)Ar(CT) = 1 (since B is two-dimensional) and therefore ^I{A; S)o- = 0. How- 
ever, this last equation together with the fact that a is pure implies that the purification of S must 
be entirely in A'. This is impossible since S is maximally mixed over a four-dimensional system 
whereas A' is two-dimensional, and hence the optimal a cannot be pure. 
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A Asymptotic equality 

Here we formally define the asymptotic equations involving the ^^a) relation. Let ^ = {^(i) , "0(2), ■■■ } 
and If = {'P{i),'-P(2),---} be two families of quantum states, where V'(n) arid ipi^n) are defined on a 
Hilbert space "H®". Then we say that 4^ ^(^a) f if limn^oo ||^{n) — V5{n)||i = 0. We then say that 
ip and ip are asymptotically equal. Note that, by the triangle inequality, ^(^a) is transitive for any 
finite number of steps independent of n. 
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It should be mentioned that throughout the paper, asymptotic families of states are not always 
explicitly referred to as such, but generally speaking, whenever a state depends on the number of 
copies, it should be considered as a family of states. In addition, with a slight abuse of notation, 
we allow quantum operations on families of states; it should be clear which operation is done on 
each member of the family. 



B Typical subspaces 

Much of information theory relies on the concept of typical sequences. Let X be some alphabet and 
let X he a random variable defined on X and distributed according to p{x). Define the e-typical 
set as follows: 

r,(") = |x" G ;f"| |-ilogPr{X" = x"} - H{X) \ < e} 

where X" refers to n independent, identically-distributed copies of X. It can be shown that the 
two following properties hold: 

1. There exists a function e{n) such that lim„^ooe("^) = and such that Pr{X" G ^^(")} ^ 
1 -e(n). 

2. There exists an no such that for all n > no, \t}'^''\ ^ 2'^l^(^)+^]. 

The quantum generalization of these concepts is relatively straightforward: let = J2xex Pi^^) I ^) (^1 
be the spectral decomposition of a quantum state on a quantum system A. Then we can define 
the typical projector on the quantum system A®" as follows: 

We call the support of n^"^ the e-typical subspace of (For brevity, we often omit e and refer 
simply to the typical subspace. In this case, unless otherwise stated, e can be assumed to be a 
positive constant, independent of n.) The two properties given above generalize to the quantum 
case: 



e{n)P 



1. There exists a function such that lim.„_»oo ^{n) = and such that Tr 
1 -e(n). 

2. There exists an no such that for all n > no, Tr[ni"^] ^ 2"[^(^)+^]. 

Note that the first of these two properties implies that n^"^^ • p^®" ps^^-, p^'^"', via the "gentle 

measurement" lemma (Lemma 9 in [Win99]). With some abuse of notation, we will use 11^ 
to refer to a family of typical projectors on which satisfies the two properties above. 
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